AITopics | mutual information loss

Computing approximate nearest neighbors in high dimensional spaces is a central problem in large-scale data mining with a wide range of applications in machine learning and data science. A popular and effective technique in computing nearest neighbors approximately is the locality-sensitive hashing (LSH) scheme. In this paper, we aim to develop LSH schemes for distance functions that measure the distance between two probability distributions, particularly for f-divergences as well as a generalization to capture mutual information loss. First, we provide a general framework to design LHS schemes for f-divergence distance functions and develop LSH schemes for the generalized Jensen-Shannon divergence and triangular discrimination in this framework. We show a two-sided approximation result for approximation of the generalized Jensen-Shannon divergence by the Hellinger distance, which may be of independent interest. Next, we show a general method of reducing the problem of designing an LSH scheme for a Krein kernel (which can be expressed as the difference of two positive definite kernels) to the problem of maximum inner product search.

locality-sensitive hashing, mutual information loss, name change, (6 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.40)

Add feedback

71a58e8cb75904f24cde464161c3e766-Paper.pdf

Neural Information Processing SystemsOct-3-2025, 05:31:33 GMT

artificial intelligence, machine learning, natural language, (18 more...)

Neural Information Processing Systems

Country: North America (0.14)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)
Information Technology > Artificial Intelligence > Natural Language (0.94)
Information Technology > Data Science (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)

Add feedback

Zero-Resource Knowledge-Grounded Dialogue Generation Linxiao Li Peking University

Neural Information Processing SystemsOct-3-2025, 01:19:03 GMT

To this end, we propose representing the knowledge that bridges a context and a response and the way that the knowledge is expressed as latent variables, and devise a variational approach that can effectively estimate a generation model from a dialogue corpus and a knowledge corpus that are independent with each other.

arxiv preprint arxiv, machine learning, natural language, (14 more...)

Neural Information Processing Systems

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Locality-Sensitive Hashing for f-Divergences: Mutual Information Loss and Beyond

Lin Chen, Hossein Esfandiari, Gang Fu, Vahab Mirrokni

Neural Information Processing SystemsOct-2-2025, 08:59:06 GMT

Neural Information Processing Systems http://nips.cc/

data mining, machine learning, natural language, (15 more...)

Neural Information Processing Systems

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.96)
Information Technology > Artificial Intelligence > Natural Language (0.95)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)

Add feedback

address each reviewer's specific questions in turn. 3 Reply to R1

Neural Information Processing SystemsAug-20-2025, 02:23:33 GMT

We thank all three reviewers for their detailed and thoughtful reviews. "How about if the slopes differ?" Per your feedback, we ran new experiments where the slopes differ. "Do the players learn from previous experience?" We do not model the player's learning but plan to in future work.

baseline, classification accuracy, mutual information loss, (13 more...)

Neural Information Processing Systems

Industry: Leisure & Entertainment > Games > Computer Games (0.50)

Technology: Information Technology > Artificial Intelligence > Games (0.31)

Add feedback

Learning Obfuscations Of LLM Embedding Sequences: Stained Glass Transform

Roberts, Jay, Mylonakis, Kyle, Roy, Sidhartha, Kale, Kaan

arXiv.org Artificial IntelligenceJun-12-2025

The high cost of ownership of AI compute infrastructure and challenges of robust serving of large language models (LLMs) has led to a surge in managed Model-as-a-service deployments. Even when enterprises choose on-premises deployments, the compute infrastructure is typically shared across many teams in order to maximize the return on investment. In both scenarios the deployed models operate only on plaintext data, and so enterprise data owners must allow their data to appear in plaintext on a shared or multi-tenant compute infrastructure. This results in data owners with private or sensitive data being hesitant or restricted in what data they use with these types of deployments. In this work we introduce the Stained Glass Transform, a learned, stochastic, and sequence dependent transformation of the word embeddings of an LLM which information theoretically provides privacy to the input of the LLM while preserving the utility of model. We theoretically connect a particular class of Stained Glass Transforms to the theory of mutual information of Gaussian Mixture Models. We then calculate a-postiori privacy estimates, based on mutual information, and verify the privacy and utility of instances of transformed embeddings through token level metrics of privacy and standard LLM performance benchmarks.

information, large language model, machine learning, (20 more...)

arXiv.org Artificial Intelligence

2506.09452

Country:

Asia (0.46)
North America > United States (0.14)

Genre: Research Report (0.83)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)

Add feedback

Reviews: Locality-Sensitive Hashing for f-Divergences: Mutual Information Loss and Beyond

Neural Information Processing SystemsJan-22-2025, 06:46:24 GMT

The paper presents locality-sensitive hashing schemes for well-studied distance function between probability distributions. The new schemes are based on the ideas. The first one is to approximate the distance function of interest by another distance function for which LSH schemes are known. In particular, the paper shows how to approximate MIL divergence and triangular discrimination by the Hellinger distance, for which LSH schemes are known. The second is specific to the MIL divergence, and involves representing the latter distance function as a so-called Krein kernel, and designing an asymmetric LSH scheme.

distance function, locality-sensitive hashing, probability distribution, (11 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning (0.37)

Add feedback

Filters

Collaborating Authors

mutual information loss

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

Locality-Sensitive Hashing for f-Divergences: Mutual Information Loss and Beyond

cc70903297fe1e25537ae50aea186306-AuthorFeedback.pdf

609c5e5089a9aa967232aba2a4d03114-Paper.pdf

Locality-Sensitive Hashing for f-Divergences: Mutual Information Loss and Beyond

71a58e8cb75904f24cde464161c3e766-Paper.pdf

Zero-Resource Knowledge-Grounded Dialogue Generation Linxiao Li Peking University

Locality-Sensitive Hashing for f-Divergences: Mutual Information Loss and Beyond

address each reviewer's specific questions in turn. 3 Reply to R1

Learning Obfuscations Of LLM Embedding Sequences: Stained Glass Transform

Reviews: Locality-Sensitive Hashing for f-Divergences: Mutual Information Loss and Beyond